Data processing method and apparatus, and readable medium and electronic device

ABSTRACT

A data processing method and apparatus, and a readable medium and an electronic device are provided. The method includes: collecting a facial image of a user by means of a terminal, and taking the collected facial image as a first facial image, displaying the first facial image in a user display area of a terminal screen; where the terminal screen displays a target object, collecting a sound signal of the user, and taking the collected sound signal as a target sound signal; and if the sound intensity of the target sound signal is within a first intensity range, switching a display state of the target object, and switching the first facial image in the user display area to a second facial image, wherein the second facial image is obtained on the basis of the first facial image.

The present is a continuation of international Application No.PCT/CN2021/099112, filed on Jun. 9, 2021 which disclosure claimspriority of Chinese Patent Application No. 202010609854.5, titled “DATAPROCESSING METHOD AND APPARATUS, AND READABLE MEDIUM AND ELECTRONICDEVICE”, filed with the China National Intellectual PropertyAdministration on Jun. 29, 2020, both of which are incorporated hereinby reference in their entireties.

TECHNICAL FIELD

The present disclosure relates to the field of data processing, inparticular to a data processing method and apparatus, a readable mediumand an electronic device.

BACKGROUND

With the development of sound control technology, more and moreelectronic devices are equipped with sound control function. Forexample, a user can control the operation of a smart phone throughsound. For example, in a game scene, the user controls or switches themotion state of a character in the game through sound. However, suchcontrol method is relatively simple, and cannot provide the user with amore advanced and richer use experience.

SUMMARY

The section of the Summary is provided to introduce a concept in asimplified form that is described in detail in the section of DetailedDescription of the Embodiments that follows. The section of the Summaryis not intended to identify key features or essential features of theclaimed technical solution, and is not intended to limit the scope ofthe claimed technical solution.

In the first aspect, a data processing method is provided according tothe present disclosure. The method includes:

acquiring, by a terminal, a face image of a user as a first face image;

displaying the first face image in a user display area of a screen ofthe terminal;

acquiring a sound signal of the user as a target sound signal, in a casethat a target object is displayed on the screen of the terminal;

switching a display state of the target object and switching the firstface image in the user display area to a second face image, in responseto a sound intensity of the target sound signal within a first intensityrange, where the second face image is obtained based on the first faceimage.

In the second aspect, a data processing apparatus is provided accordingto the present disclosure. The apparatus includes:

a first acquisition module configured to acquire a face image of a userby a terminal as a first face image;

a first display module configured to display the first face image in auser display area of a screen of the terminal;

a second acquisition module configured to acquire a sound signal of theuser as a target sound signal, in a case that a target object isdisplayed on the screen of the terminal;

a second display module configured to switching a display state of thetarget object and switching the first face image in the user displayarea to a second face image, in response to a sound intensity of thetarget sound signal within a first intensity range, where the secondface image is obtained based on the first face image.

In the third aspect, a computer-readable medium storing a computerprogram thereon is provided according to the present disclosure. Theprogram, when executed by a processing apparatus, performs steps of themethod according to the first aspect of the present disclosure.

In the fourth aspect, an electronic device is provided according to thepresent disclosure. The electronic device includes:

a storage apparatus storing a computer program;

a processing apparatus for executing the computer program in the storageapparatus to perform steps of the method according to the first aspectof the present disclosure.

In the above technical solution, the face image of the user is acquiredby the terminal as the first face image. The first face image isdisplayed in the user display area of the screen of terminal. When thetarget object is displayed on the screen of the terminal, the soundsignal of the user is acquired as the target sound signal. If the soundintensity of the target sound signal is within the first intensityrange, the display state of the target object is switched, and the firstface image in the user display area is switched to the second faceimage. The second face image is obtained based on the first face image.In this way, the user can see his own face image in real time throughthe screen of the terminal, enhancing the sense of participation andimmersion for the user to operate the terminal. Moreover, the user canchange the display content on the screen of the terminal through sound,without manual control by the user in the whole process, improving theexperience of the user using the terminal.

Other features and advantages of the present disclosure will bedescribed in detail in the section of Detailed Description of theEmbodiments that follows.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart showing a data processing method according to anembodiment of the present disclosure;

FIG. 2 is an exemplary schematic diagram of a user display area in thedata processing method according to the present disclosure;

FIG. 3 is another exemplary schematic diagram of the user display areain the data processing method according to the present disclosure;

FIG. 4 is an exemplary flowchart showing a process of switching adisplay state of a target object and switching a first face image in theuser display area to a second face image in a data processing methodaccording to the present disclosure;

FIG. 5 is a block diagram showing a data processing apparatus accordingto an embodiment of the present disclosure; and

FIG. 6 is a structural schematic diagram of an electronic devicesuitable for implementing an embodiment of the present disclosure.

DETAILED DESCRIPTION OF THE EMBODIMENTS

The embodiments of the present disclosure are described in detail belowwith reference to the accompanying drawings. Although the drawings showsome embodiments of the present disclosure, it should be understood thatthe present disclosure can be implemented in various forms and is notlimited to the embodiments described herein. The embodiments areprovided for a more thorough and complete understanding of the presentdisclosure. It should be understood that the drawings and theembodiments in the present disclosure are only illustrative of thedisclosure, and are not intended to limit the protection scope of thepresent disclosure.

It should be understood that the steps of the method according to theembodiments of the present disclosure may be performed in differentorders, and/or be performed in parallel. In addition, the methodembodiments may include an additional step and-'or an omitted step thatis not shown herein. The scope of the present disclosure is not limitedin this aspect.

The term “including” and variants thereof as used herein are open-endedincludes, that is, “including but not limited to”. The term “based on”means “based at least in part on.” The term “one embodiment” means “atleast one embodiment”. The term “another embodiment” means “at least oneadditional embodiment”. The term “some embodiments” means “at least someembodiments”. Definitions of other terms are provided in the followingdescription.

It should be noted that the concepts such as “first” and “second”mentioned in the present disclosure are used to distinguish differentapparatus, modules or units, and are not used to limit an sequentialorder or interdependence of the functions performed by the apparatus,modules or units.

It should be noted that, the modifications such as “one” and “multiple”mentioned in the present disclosure are schematic rather thanrestrictive, and should be understood as “one or more” by those skilledin the art, otherwise explicitly illustrated in the context.

The names of messages or information exchanged between multipleapparatuses in the embodiments of the present disclosure are only forillustrative purposes, and are not intended to limit the scope of themessages or information.

FIG. 1 is a flowchart showing a data processing method according to anembodiment of the present disclosure. As shown in FIG. 1 , the methodincludes step 11 to step 14.

In step 11, a face image of a user is acquired by a terminal as a firstface image.

In step 12. the first face image is displayed in a user display area ofa screen of the terminal.

In step 13, a sound signal of the user is acquired as a target soundsignal, in the case that a target object is displayed on the screen ofthe terminal.

In step 14, a display state of the target object is switched and thefirst face image in the user display area is switched to a second faceimage, in response to a sound intensity of the target sound signalwithin a first intensity range, where the second face image is obtainedbased on the first face image.

In steps 11 and 12, in the step of acquiring the face image by theterminal, the image of the user may be acquired by an image capturingapparatus (for example, a camera) provided on the terminal, and the faceimage of the user is used as the first face image. For example, if theterminal is a mobile phone and the user holds the mobile phone and facesthe screen of the mobile phone, the face image of the user may beacquired by a front camera of the mobile phone, which may be used as thefirst face image. After the first face image is acquired, the first faceimage may be displayed in the user display area of the screen of theterminal. The operations of steps 11 and 12 is the operations ofacquiring the face image of the user in real time, and displaying it inthe user display area of the screen of the terminal to show the state ofthe user in real time. In this way, the experience of the user using theterminal is improved.

The sound of the user to the target object may be acquired by theterminal. When the target object is displayed on the screen of theterminal, the sound signal of the user may be acquired. For example, thesound signal of the user may be acquired by a sound acquisition device(e. g., a sound card) provided on the terminal, as the target soundsignal.

After the target sound signal is acquired, the sound intensity of thetarget sound signal may be determined based on the target sound signal.For example, the sound intensity may be represented by a decibel, avolume, and the like. A case that the sound intensity of the targetsound signal is within the first intensity range indicates that thesound of the user meets the intensity requirement of the first intensityrange. The first intensity range may be set according to actual needs.For example, the desired sound intensity range is used as the firstintensity range. For example, the first intensity range may be set to begreater than or equal to a first intensity threshold.

If the sound intensity of the target sound signal is within the firstintensity range, the following operations may be performed:

switching the display state of the target object; and

switching the first face image in the user display area to the secondface image.

The second face image is obtained based on the first face image. Forexample, the second face image may be an image obtained by superimposingmaterial on the first face image. For example, the second face image isgenerated by superimposing special effects (or stickers, expressions,etc) on the first face image. In a special case, the second face imagemay be consistent with the first face image, that is, there is noobvious change in the image observed by the user.

Based on the operations above, when the intensity of the sound of theuser is within the first intensity range, the display state of thetarget object and the face image in the user display area are changedaccordingly. In this way, the user can see his own face image in realtime through the screen of the terminal, enhancing the sense ofparticipation and immersion for the user to operate the terminal.Moreover, the display content on the screen of the terminal can bechanged through the sound, which may be affected by the sound actuallyemitted by the user. In the whole process, no manual control by the useris required, improving the experience of the user using the terminal.

In order to enable those skilled in the art to better understand thepresent solution, the solution of the present disclosure will bedescribed by way of example in combination with specific scenariosbelow. For the convenience of description, a currently common terminalgame is taken as an example. It is assumed that in a game of breaking abrick displayed on the screen, the user needs to break the brick on thescreen through sound. In this scenario, the target object is the brickdisplayed on the screen. The first intensity range is an intensity rangeof the sound for breaking the brick, which is preset in the game. Thegeneral process of the present solution may be described as follows.

When the user starts the game, the face image of the user (i.e., thefirst face image) is acquired by the terminal in real time and isdisplayed in the user display area of the screen of the terminal, Theuser may directly observe his facial expression through the screen inreal time.

When a brick appears on the screen of the terminal, the terminal startsto acquire the sound signal of the user (i.e., target sound signal),determines whether the sound intensity of the target sound signal iswithin the first intensity range. If the sound intensity of the targetsound signal is within the first intensity range, the display state ofthe brick is switched. For example, the display state of the brick isswitched to a broken state, where a brick image or animation in thebroken state may be drawn in advance. Moreover, the first face image inthe user display area is switched to the second face image. For example,an image obtained by adding a sticker to the first face image isdisplayed,

In the above scenario, various types of bricks may be preset, and eachtype of bricks has a first intensity range corresponding to this type ofbricks. When a type of brick is displayed on the screen of the terminal,the first intensity range corresponding to this type of brick is used.

That is, in the solution of the present disclosure, the target objectmay be set arbitrarily, and the first intensity range used to determinethe target sound signal corresponds to the target object.

In the above technical solution, the face image of the user is acquiredby the terminal as the first face image. The first face image isdisplayed in the user display area of the screen of terminal. When thetarget object is displayed on the screen of the terminal, the soundsignal of the user is acquired as the target sound signal. If the soundintensity of the target sound signal is within the first intensityrange, the display state of the target object is switched, and the firstface image in the user display area is switched to the second faceimage. The second face image is obtained based on the first face image.In this way, the user can see his own face image in real time throughthe screen of the terminal, enhancing the sense of participation andimmersion for the user to operate the terminal, Moreover, the user canchange the display content on the screen of the terminal through sound,without manual control by the user in the whole process, improving theexperience of the user using the terminal.

The solution of the present disclosure will be described below in detailbased on the steps in FIG. 1 .

In step 12, the first face image is displayed in the user display areaof the screen of the terminal.

In a possible embodiment, the first face image may be displayed in adesignated area (such as a designated square area, a designated circulararea, etc.) on the top of the screen of the terminal. FIG. 2 is anexemplary schematic diagram of a user display area The outer boxrepresents the screen of the terminal, and the acquired face image ofthe user is displayed in a part F1 of the user display area

Optionally, a body image may also be displayed in the user display areaGenerally, the body image is connected with the first face image to forma complete image of the user. Accordingly, the method provided in thepresent disclosure may include the following steps:

determining position information of the user according to the first faceimage;

determining a body image corresponding to the first face image accordingto the position information and pre-stored image material; and

displaying the body image in the user display area

The position information of the user is used to indicate an offsetdirection and offset angle of the user relative to the terminal, whichmay be determined by the first face image. For example, a centerline ofthe face of the user may be determined by the first face image, and thecenterline of the face of the user is compared with a centerline of aviewfinder of the terminal, to determine the offset direction and offsetangle of the user relative to the terminal. As shown in FIG. 2 , theuser display area includes an area F1 for displaying the first faceimage and an area F2 for displaying the body image. P1 is the centerlineof the viewfinder, FIG. 2 shows an image in the case of no offset of thefirst face image relative to the terminal,

In FIG. 2 , the centerline of the face of the user is consistent withthe centerline of the viewfinder. Therefore, the body image displayed inthe user display area in FIG. 2 is also no offset. As shown in FIG. 3 ,P2 is the centerline of the face of the user determined based on thefirst face image in area F1, and P1 is the centerline of the viewfinder.Based on the centerline P2 of the face of the user determined based onthe first face image, the position information of the user is determinedas an offset angle α to the right (clockwise) of the user relative tothe terminal. Accordingly, the body image displayed in the user displayarea in FIG. 3 also deviates at the same offset angle in the same offsetdirection.

After the position information of the user is determined, the body imagecorresponding to the first face image is determined according to theposition information and the pre-stored image material, and is displayedin the user display area of the screen of the terminal. For example, theimage material may be an original body image without an offset drawn inadvance. The original body image rotates a corresponding offset angle inthe corresponding offset direction based on the offset direction andoffset angle in the position information of the user, to obtain the bodyimage for display. For example, if the user holds the mobile phone andtilts his head to the left, the body image displayed in the user displayarea is also tilted to the left. In addition, the position informationof the user can not only be two-dimensional, but also three-dimensional.For example, in the case that the position information of the user isthree-dimensional, a side body image may be displayed in the userdisplay area when the user shows a side face to the terminal. Thus, theactual actions of the user may be well reflected in the screen display,enhancing the immersion sense for the user and improving the userexperience.

Step 13 is described in detail below, in which the sound signal of theuser is acquired as the target sound signal in the case that the targetobject is displayed on the screen of the terminal.

In a possible embodiment, the target object may be statically displayedon the screen of the terminal, that is, the target object is displayedat a fixed position on the screen of the terminal until it disappearsfrom the screen of the terminal. In this embodiment, step 13 mayinclude: in a preset period after the target object starts to bedisplayed on the screen of the terminal, acquiring the sound signal ofthe user as the target sound signal.

The preset period after the target object starts to be displayed on thescreen of the terminal may be an entire period in which the targetobject is displayed on the screen of the terminal (that is, an entireprocess of displaying the target object on the screen of the terminal),or a pt of the entire period in which the target object is displayed onthe screen of the terminal. For example, the acquisition starts when thetarget object starts to be displayed on the screen of the terminal andends when the target object disappears from the screen of the terminal.The sound signal acquired in this process is the target sound signal.

In another possible embodiment, the target object may be dynamicallydisplayed on the screen of the terminal. The target object may move onthe screen of the terminal at a first speed along a preset path in thedisplay process of the screen of the terminal. For example, the targetobject appears on the left side of the screen of the terminal, moveshorizontally to the right side of the screen of the terminal at thefirst speed, and then disappears.

In this embodiment, step 13 may include the following steps: when thetarget object is located in a designated area of the screen of theterminal, acquiring the sound signal of the user as the target soundsignal.

For example, an area of the screen of the terminal (for example, acircular area, a square area and the like) may be preselected as thedesignated area When the target object is displayed and moved on thescreen of the terminal, the position of the target object is knowable.Thus, when the target object enters the designated area, the soundsignal of the user is acquired; and when the target object is not in thedesignated area, the sound signal of the user is not acquired.

In addition, when the target object moves dynamically in the mannerdescribed above, the method provided in the present disclosure may alsoinclude the following steps:

performing a sound content recognition on the target sound signal todetermine a sound content corresponding to the target sound signal; and

determining a second speed corresponding to a preset keyword, in thecase that the sound content corresponding to the target sound signalincludes the preset keyword; and switching a moving speed of the targetobject on the screen of the terminal from the first speed to the secondspeed.

The second speed corresponding to the preset keyword may be obtainedthrough the correspondence between the pre-stored keyword and the speed.For example, the preset keyword may be acceleration or deceleration. Inthe correspondence between the pre-stored keyword and the speed, thekeyword “acceleration” may correspond to the speed A1 which is fasterthan the first speed, and the keyword “deceleration” may correspond tothe speed A2 which is slower than the first speed. Thus, if the usersays “acceleration”, the preset keyword “acceleration” may be determinedby the sound content recognition on the target sound signal, and thenthe second speed may he determined as A1. Thus, the moving speed of thetarget object on the screen of the terminal may be switched from thefirst speed to the second speed A1. From the user's perspective, afterthe user says “acceleration”, the target object moves faster on thescreen of the terminal.

With the above method and in combination with the sound contentrecognition technology, the user can control the moving speed of thetarget object through sound without manual setting or operation by theuser, improving the user experience.

Next, an example will be given for step 14.

In a possible embodiment, step 14 may include the following steps, asshown in FIG. 4 .

In step 41, a display state corresponding to the first intensity rangeis determined, in response to the sound intensity of the target soundsignal within the first intensity range; and the target object isdisplayed in the display state corresponding to the first intensityrange.

For example, a correspondence between the first intensity range and thedisplay state may be pre-stored. In this way, if the sound intensity ofthe target sound signal is within the first intensity range, the displaystate corresponding to the first intensity range may be determined, andthe target object may be displayed in the display state corresponding tothe first intensity range.

In a possible embodiment, the first intensity range may be segmentedinto several intensity intervals, and respective intensity intervalscorrespond to different display states, In this way, if the soundintensity of the target sound signal is within the first intensityrange, the intensity interval of the sound intensity of the target soundsignal may be further determined, to determine the corresponding displaystate. Here, the display state may reflect the intensity interval of thetarget sound signal in the first intensity range, and thus the displaystate is more detailed.

For example, the first intensity range may be divided into a firstintensity interval with low intensity and a second intensity intervalwith high intensity at an intermediate value. If the sound intensity ofthe target sound signal is within the first intensity interval, itindicates that the intensity of the sound of the user is within thefirst intensity range and is low. If the sound intensity of the targetsound signal is within the second intensity interval_(;) it indicatesthat the intensity of sound of the user is within the first intensityrange and is high, which may cause noise. Therefore, different displaystates (i.e., a first display state and a second display state) may beset for these two intensity intervals. For example, in the example ofbreaking the brick described above, the first display state maycorrespond to a state in which the brick is broken into four pieces, andthe second display state may correspond to a state in which the brick isbroken into eight pieces. That is, if the sound of the user is withinthe first intensity interval, the brick is broken into four pieces andif the sound of the user is within the second intensity interval, thebrick is broken into eight pieces.

In the way above, the switched display state of the target object mayreflect the intensity interval of the user sound intensity in the firstintensity range, which is convenient for the user to make subsequentadjustment to the sound, does not affect the use of the terminal, andcan prevent the problem of causing noise.

In step 42, the image material corresponding to the first intensityrange is determined, in response to the sound intensity of the targetsound signal within the first intensity range.

In step 43, the second face image is generated according to the firstface image and the image material corresponding to the first intensityrange, and the first face image is replaced with the second face image.

For example, a correspondence between the first intensity range and theimage material may be pre-stored. In this way, if the sound intensity ofthe target sound signal is within the first intensity range, the imagematerial corresponding to the first intensity range may he determined,and the second face image may be generated according to the first faceimage and the image material corresponding to the first intensity range.

In a possible embodiment, the first intensity range may also besegmented into several intensity intervals, and respective intensityintervals correspond to different image materials. In this way, if thesound intensity of the target sound signal is within the first intensityrange, the intensity interval of the sound intensity of the target soundsignal may be further determined, to determine the corresponding imagematerial. Here, the determined image material may reflect the intensityinterval of the target sound signal in the first intensity rang, andthus the presentation of the image material is more detailed.

For example, the first intensity range may be divided into a firstintensity interval with low intensity and a second intensity intervalwith high intensity at an intermediate value. If the sound intensity ofthe target sound signal is within the first intensity interval, itindicates that the intensity of the sound of the user is within thefirst intensity range and is low If the sound intensity of the targetsound signal is within the second intensity interval, it indicates thatthe intensity of sound of the user is within the first intensity rangeand is high, which may cause noise. Therefore, different image materials(i.e., first image material and second image mated al) may be set forthese two intensity intervals. For example, in the example of breakingthe brick described above, the first image material may correspond to asmile expression sticker, and the second image material may correspondto a surprise expression sticker. That is, if the sound of the user iswithin the first intensity interval, the smile expression sticker isadded to the face image of the user; and if the sound of the user iswithin the second intensity interval, the surprise expression sticker isadded to the face image of the user.

In the way above, the switched face image (i.e., the second face image)may reflect the intensity interval of the sound intensity of the user inthe first intensity rang, which is convenient for the user to makesubsequent adjustment to the sound, does not affect the use of theterminal, and can prevent the problem of causing noise.

It should be noted that the execution of the step 41 and steps 42 to 43above is not limited in order. Steps 41 to 43 may be performedsequentially in the order shown in FIG. 4 . Alternatively, steps 42 to43 may be performed first, and then step 41 is performed. Alternatively,step 41 and steps 42 to 43 may be performed simultaneously,

Optionally, the method according to the present disclosure may furtherinclude: keeping the display state of the target object unchanged andswitching the first face image in the user display area to the thirdface image, in response to the sound intensity of the target soundsignal within the second intensity range.

For example, the second intensity range may be set to an intensity rangeobtained by removing the first intensity range. For example, if thefirst intensity range is set to be greater than or equal to the firstintensity threshold, the second intensity range may be an intensityrange from zero to the first intensity threshold.

In addition, the face image may also be switched accordingly, The thirdface image is obtained based on the first face image. For example, amanner for determining the third face image may include: determining animage material corresponding to the second intensity range, if the soundintensity of the target sound signal is within the second intensityrange; and generating a third face image according to the first faceimage and the image material corresponding to the second intensityrange, and replacing the first face image with the third face image.

For example, a correspondence between the second intensity range and theimage material may be pre-stored. In this way, if the sound intensity ofthe target sound signal is within the second intensity range, the imagematerial corresponding to the second intensity range may be determined,and the third face image may be generated according to the first faceimage and the image material corresponding to the second intensityrange. For example, in the example of breaking the brick describedabove, the image material corresponding to the second intensity rangemay correspond to a crying expression sticker. That is, if the sound ofthe user cannot break the brick, the crying expression sticker is addedto the face image of the user. In this way, it is convenient for theuser to make subsequent adjustment to the sound.

In an embodiment, the method provided in the present disclosure mayfurther include: determining a first action value according to the soundintensity of the target sound signal, in response to obtaining aninstruction for determining a user action value; and determining theuser action value according to the first action value.

The first action value is used to characterize a matching degree betweenthe sound intensity of the target sound signal and the standardintensity demanded by the target object.

In other words, the intensity of the sound of the user is closer to thestandard intensity demanded by the target object, the larger the firstaction value is. For example, a correspondence between the soundintensity and the action value may be preset for the target object, sothat the first action value may be directly determined according to thesound intensity of the target sound signal.

After the first action value is determined, the user action value may befurther determined. For example, the first action value may be directlyused as the user action value. In the example of breaking the brickdescribed above, the user action value corresponds to game scoresobtained by the user in a single brick breaking process.

In an embodiment, the method according to the present disclosure mayfurther include: determining a second action value according to thefirst face image, in response to obtaining an instruction fordetermining the user action value; and determining the user action valueaccording to the second action value.

The second action value is used to characterize a matching degreebetween an expression feature of the first face image and a standardfeature demanded by the target object. That is, the expression of theuser is closer to the standard feature demanded by the target object,the larger the second action value is.

After the second action value is determined, the user action value maybe further determined. For example, the second action value may be usedas the user action value.

In an embodiment, the user action value may be determined in combinationwith the sound and facial expression of the user, that is, the useraction value is jointly determined in combination with the first actionvalue and the second action value. For example, after the first actionvalue and the second action value are determined, the user action valuemay be determined according to the first action value and the secondaction value. For example, calculation weights may be preset for thefirst action value and the second action value, and the user actionvalue may be determined according to the first action value, the secondaction value and their respective weights,

After the user action value is determined, the user action value may befurther displayed on the screen of the terminal to notify the user. Forexample, in the example of breaking the brick described above,displaying the user action value is to display the scores obtained bybreaking the brick by the user this time.

FIG. 5 is a block diagram showing a data processing apparatus accordingto an embodiment of the present disclosure. As shown in FIG. 5 , thedata processing apparatus 50 may include a first acquisition module Si.a first display module 52. a second acquisition module 53, and a seconddisplay module 54.

The first acquisition module 51 is configured to acquire a face image ofa user by a terminal as a first face image.

The first display module 52 is configured to display the first faceimage in a user display area of a screen of the terminal.

The second acquisition module 53 is configured to acquire a sound signalof the user as a target sound signal, in a case that a target object isdisplayed on the screen of the terminal.

The second display module 54 is configured to switch a display state ofthe target object and switch the first face image in the user displayarea to a second face image, in response to a sound intensity of thetarget sound signal within a first intensity range, where the secondface image is obtained based on the first face image.

Optionally, a body image is displayed in the user display area

The apparatus 50 also includes a first determination module, a seconddetermination module, and a third determination module.

The first determination module is configured to determine positioninformation of the user according to the first face image, where theposition information is used to indicate an offset direction and anoffset angle of the user relative to the terminal.

The second determination module is configured to determine the bodyimage corresponding to the first face image, according to the positioninformation and pre-stored image material.

The third determination module is configured to display the body imagein the user display area

In an embodiment, the target object moves on the screen of the terminalat a first speed along a preset path, in a process of displaying thetarget object on the screen of the terminal.

In an embodiment, the second acquisition module 53 is configured toacquire the sound signal of the user as the target sound signal, in acase that the target object is located in a designated area of thescreen of the terminal.

In an embodiment, the apparatus 50 further includes a fourthdetermination module and a fifth determination module.

The fourth determination module is configured to perform a sound contentrecognition on the target sound signal, to determine a sound contentcorresponding to the target sound signal.

The fifth determination module is configured to determine a second speedcorresponding to a preset keyword, in the case that the sound contentcorresponding to the target sound signal includes the preset keyword,and switch a moving speed of the target object on the screen of theterminal from the first speed to the second speed.

In an embodiment, the second display module 54 includes a firstdetermination sub-module, a second determination sub-module and an imagegeneration sub-module.

The first determination sub-module is configured to determine a displaystate corresponding to the first intensity range in response to thesound intensity of the target sound signal within the first intensityrange; and display the target object in the display state correspondingto the first intensity range.

The second determination sub-module is configured to determine imagematerial corresponding to the first intensity range, in response to thesound intensity of the target sound signal within the first intensityrange.

The image generation sub-module is configured to generate the secondface image according to the first face image and the image materialcorresponding to the first intensity range, and replace the first faceimage with the second face image.

in an embodiment, the apparatus 50 further includes a third displaymodule configured to keep a display state of the target object unchangedin response to the sound intensity of the target sound signal within asecond intensity range, and switch the first face image in the userdisplay area to a third face image, where the third face image isobtained based on the first face image.

In an embodiment, the apparatus 50 further includes a sixthdetermination module and a seventh determination module.

The sixth determination module is configured to determine a first actionvalue according to the sound intensity of the target sound signal, inresponse to obtaining an instruction for determining a user actionvalue, where the first action value is used to characterize a matchingdegree between the sound intensity of the target sound signal and astandard intensity demanded by the target object.

The seventh determination module is configured to determine the useraction value according to the first action value.

In an embodiment, the apparatus 50 further includes an eighthdetermination module and a ninth determination module.

The eighth determination module is configured. to determine a secondaction value according to the first face image_(—) in response toobtaining an instruction for determining a user action value, where thesecond action value is used to characterize a matching degree between anexpression feature of the first face image and a standard featuredemanded by the target object.

The ninth determination module is configured to determine the useraction value according to the second action value.

For the apparatus according to the embodiments described above, themanners in which the modules perform operations have been described indetail in the embodiments of the method, which will not be described indetail herein.

Reference is made to FIG. 6 , which shows a schematic structural diagramof an electronic device 600 suitable for implementing the embodiments ofthe present disclosure. The terminal device according to the embodimentsof the present disclosure may include, but are not limited to, mobileterminals, such as mobile phones, laptops, digital broadcast receivers,PDAs (personal digital assistants), PADs (tablet PCs), PMPs (portablemultimedia players), vehicle-mounted terminals (such as in-vehiclenavigation terminals) and the like, and fixed terminals such as digitalTVs, desktop computers, and the like. The electronic device shown inFIG. 6 is only exemplary, and should not indicate any limitation to thefunction and application scope of the embodiments of the presentdisclosure.

As shown in FIG. 6 , the electronic device 600 may include a processingapparatus 601, such as a central processor or a graphics processor,which may execute various proper operations and processing based on aprogram stored in a Read Only Memory (ROM) 602 or a program loaded froma storage apparatus 608 into a Random Access Memory (RAM) 603. The RAM603 is further configured to store various programs and data demanded bythe electronic device 600. The processing apparatus 601, the ROM 602 andthe RAM 603 are connected to each other through a bus 604. AnInput/output WO) interface 605 is also connected to the bus 604.

Generally, the I/O interface 605 may be connected to: an input apparatus606, such as a touch screen, a touch panel, a keyboard, a mouse, acamera, a microphone, an accelerometer, and a gyroscope; an outputapparatus 607, such as a liquid crystal display (LCD), a speaker, and avibrator; a storage apparatus 608 such as a magnetic tape and a harddisk; and a communication apparatus 609. The communication apparatus 609enables wireless or wired communication between the electronic device600 and other devices for data exchanging. Although FIG. 6 shows anelectronic device 600 having various apparatus, it should be understoodthat the illustrated apparatus are not necessarily required to all beimplemented or embodied. Alternatively, more or fewer apparatuses may beimplemented or included.

Particularly, according to the embodiments of the present disclosure,the process described above in conjunction with ⁻flowcharts may beimplemented as a computer software program. For example, a computerprogram product is further provided as an embodiment in the presentdisclosure, including a computer program carried on a non-transitorycomputer readable medium. The computer program includes program code forperforming the method shown in the flowchart. In the embodiment, thecomputer program may be downloaded and installed from the network viathe communication apparatus 609, or installed from the storage 608, orinstalled from the ROM 602. When the computer program is executed by theprocessing apparatus 601, the functions defined in the method accordingto the embodiment of the present disclosure are performed.

It should be noted that, the computer readable medium mentioned in thepresent disclosure may be a computer readable signal medium or acomputer readable storage medium or any combination thereof The computerreadable storage medium may be but is not limited to, a system, anapparatus, or a device in an electronic, magnetic, optical,electromagnetic, infrared, or semi-conductive form, or any combinationthereof More specific examples of the computer readable storage mediummay be, but are not limited to, an electrical connection with one ormore wires, a portable computer disk, a hard disk, a random accessmemory (RAM), a read-only memory (ROM) an erasable programmableread-only memory (EPROM or flash memory), an optical fiber, a portablecompact disc read-only memory (CD-ROM) , a light storage device, amagnetic storage device or any combination thereof. In the presentdisclosure, the computer readable storage medium may be any tangiblemedium containing or storing a program, and the program may be used byor in combination with an instruction execution system, apparatus, ordevice. In the present disclosure, the computer readable signal mediummay be a data signal transmitted in a baseband or transmitted as a partof a carrier wave and carrying computer readable program codes. Thetransmitted data signal may be in various forms, including but notlimited to an electromagnetic signal, an optical signal or any propercombination thereof. The computer readable signal medium may be anycomputer readable medium other than the computer readable storagemedium, and can send, propagate or transmit programs to be used by or incombination with an instruction execution system, apparatus or device.The program codes stored in the computer readable medium may betransmitted via any proper medium including but not limited to: a wire,an optical cable, RF (radio frequency) and the like, or any propercombination thereof.

In some embodiments, the client device can communicate using anycurrently known or future developed network protocol such as HyperTextTransfer Protocol (HTTP), and can be interconnected with any form ormedium of digital data communication (e.g., a

communication network). Examples of a communication network include alocal area network (“LAN”), a wide area network (“WAN”), the InternetWork (for example, the Internet), and a peer-to-peer network (forexample, an ad hoc peer-to-peer network), as well as any currently knownnetwork or a future developed network.

The computer readable medium may be incorporated in the electronicdevice, or may exist alone without being assembled into the electronicdevice.

The computer-readable medium carries one or more programs. The one ormore programs, when executed by the electronic device, cause theelectronic device to: acquire the face image of the user by the terminalas the first face image; display the first face image in the userdisplay area of the screen of the terminal; acquiring the sound signalof the user as the target sound signal, in a case that a target objectis displayed on the screen of the terminal; switch the display state ofthe target object and switch the first face image in the user displayarea to a second face image, in response to the sound intensity of thetarget sound signal within the first intensity range, where the secondface image is obtained based on the first face image.

The computer program code for performing the operations disclosed in thepresent disclosure may be written in one or more programming languagesor combinations thereof. The programming languages include, but notlimit to an object-oriented programming language, such as Java,Smalltalk, and C++, and a conventional procedural programming language,such as C language or a similar programming language. The program codemay be executed entirely on a user computer, partially on the usercomputer, as a standalone software package, partially on the usercomputer and partially on a remote computer, or entirely on the remotecomputer or a server. In a case involving a remote computer, the remotecomputer may be connected to the user computer through any network,including local area network (LAN) or wide area network (WAN), or may beconnected to an external computer (for example, through the Internetprovided by an Internet service provider).

Flowcharts and block diagrams in the drawings illustrate thearchitecture, functions and operations that can be implemented by asystem, a method and a computer program produce according to variousembodiments of the present disclosure. In this regard, each block in theflowchart or the block diagram may represent a module, a programsegment, or a part of code. The module, the program segment, or the partof code includes one or more executable instructions used forimplementing specified logic functions. It should be noted that, in somealternative implementations, the functions marked in blocks may beperformed. in an order different from the order shown in the drawings.For example, two blocks shown in succession may actually be executed inparallel, or sometimes may be executed in a reverse order, which dependson the functions involved. It should also be noted that each of theblock in the block diagram andlor flowchart and a combination of theblocks in the block diagram andlor flowchart may be implemented by adedicated hardware-based system that performs specified functions oractions, or may be realized by a combination of dedicated hardware andcomputer instructions.

The modules mentioned in the embodiments of the present disclosure maybe implemented by means of software, or otherwise by means of hardware.The name of the module does not constitute a restriction on the moduleitself in some cases. For example, the first acquisition module may alsobe described as “a module for acquiring the face image of the user bythe terminal”.

The functions described herein above may be performed, at least in part,by one or more hardware logic components. For example, withoutlimitation, exemplary types of hardware logic components that can beused include: Field Programmable Gate Array (FPGA), Application SpecificIntegrated Circuit (ASIC), Application Specific Standard Product (ASSP),System on Chip (SOC), Complex Programmable Logical device (CPLD) and thelike.

In the context of the present disclosure, the machine readable mediummay be a tangible medium that may contain or store a program, and theprogram may be used by or in connection with an instruction executionsystem, apparatus or device. The machine readable medium may be amachine readable signal medium or a machine readable storage medium. Thecomputer readable medium may include, but is not limited to, system, anapparatus, or a device in an electronic, magnetic, optical,electromagnetic, infrared, or semi-conductive form, or any suitablecombination thereof More specific examples of the machine readablestorage medium may include, one or more wire based electricalconnections, a portable computer disk, a hard disk, a random accessmemory (RAM), a read-only memory (ROM), an erasable programmableread-only memory (EPROM or Fast flash memory), an optical fiber, aportable to compact disc read-only memory (CD-ROM), an optical storagedevice, a magnetic storage device or any suitable combination thereof.

According to one or more embodiments of the present disclosure, a dataprocessing method is provided. The method includes:

acquiring, by a terminal, a face image of a user as a first face image;

displaying the first face image in a user display area of a screen ofthe terminal;

acquiring a sound signal of the user as a target sound signal, in a casethat a target object is displayed on the screen of the terminal; and

switching a display state of the target object and switching the firstface image in the user display area to a second face image, in responseto a sound intensity of the target sound signal within a first intensityrange, where the second face image is obtained based on the first faceimage.

According to one or more embodiments of the present disclosure, in thedata processing method, a body image is displayed in the user displayarea

The data processing method further includes:

determining position information of the user according to the first faceimage, where the position information is used to indicate an offsetdirection and offset angle of the user relative to the terminal;

determining the body image corresponding to the first face image,according to the position information and pre-stored image material; and

displaying the body image in the user display area

According to one or more embodiments of the present disclosure, in thedata processing method, the target object moves on the screen of theterminal at a first speed along a preset path, in a process ofdisplaying the target object on the screen of the terminal.

According to one or more embodiments of the present disclosure, in thedata processing method, the acquiring the sound signal of the user as atarget sound signal in a case that a target object is displayed on thescreen of the terminal includes:

acquiring the sound signal of the user as the target sound signal, in acase that the target object is located in a designated area of thescreen of the terminal.

According to one or more embodiments of the present disclosure, the dataprocessing method further includes:

performing a sound content recognition on the target sound signal todetermine a sound content corresponding to the target sound signal; and

determining a second speed corresponding to a preset keyword, in thecase that the sound content corresponding to the target sound signalincludes the preset keyword, and switching a moving speed of the targetobject on the screen of the terminal from the first speed to the secondspeed.

According to one or more embodiments of the present disclosure, in thedata processing method, the switching a display state of the targetobject and switching the first face image in the user display area to asecond face image, in response to a sound intensity of the target soundsignal within a first intensity range, includes:

determining a display state corresponding to the first intensity range,in response to the sound intensity of the target sound signal within thefirst intensity range; and displaying the target object in the displaystate corresponding to the first intensity range;

determining image material corresponding to the first intensity range inresponse to the sound intensity of the target sound signal within thefirst intensity range; and

generating the second face image according to the first face image andthe image material corresponding to the first intensity range, andreplacing the first face image with the second face image.

According to one or more embodiments of the present disclosure, the dataprocessing method further includes:

keeping a display state of the target object unchanged in response tothe sound intensity of the target sound signal within the secondintensity range, and switching the first face image in the user displayarea to a third face image, where the third face image is obtained basedon the first face image.

According to one or more embodiments of the present disclosure, a dataprocessing method further includes:

determining a first action value according to the sound intensity of thetarget. sound. signal, in response to obtaining an instruction fordetermining a user action value, where the first action value is used tocharacterize a matching degree between the sound intensity of the targetsound signal and a standard intensity demanded by the target object; and

determining the user action value according to the first action value.

According to one or more embodiments of the present disclosure, the dataprocessing method further includes:

determining a second action value according to the first face image, inresponse to obtaining an instruction for determining a user actionvalue, where the second action value is used to characterize a matchingdegree between an expression feature of the first face image and astandard feature demanded by the target object; and

determining the user action value according to the second action value.

According to one or more embodiments of the present disclosure, a dataprocessing apparatus is provided. The apparatus further includes:

a first acquisition module configured to acquire a face image of a userby a terminal as a first face image;

a first display module configured to display the first face image in auser display area of a screen of the terminal;

a second acquisition module configured to acquire a sound signal of theuser as a target sound signal, in a case that a target object isdisplayed on the screen of the terminal; and

a second display module configured to switch a display state of thetarget object and switch the first face image in the user display areato a second face image, in response to a sound intensity of the targetsound signal within a first intensity range, where the second face imageis obtained based on the first face image.

According to one or more embodiments of the present disclosure, acomputer-readable medium storing a computer program thereon is provided.The program, when executed by a processing apparatus, causes theprocessing apparatus to perforin the steps of the method according toany of the embodiments of the present disclosure.

According to one or more embodiments of the present disclosure, anelectronic device is provided. The electronic device includes:

a storage apparatus storing a computer program thereon;

a processing apparatus for executing the computer program in the storageapparatus to perform the steps of the method according to any of theembodiments of the present disclosure.

The above description includes merely preferred embodiments of thepresent disclosure and explanations of technical principles used. Thoseskilled in the art should understand that the scope of the presentdisclosure is not limited to the technical solution formed bycombination of the technical features described above, but also coversother technical solutions formed by any combination of the abovetechnical features or the equivalent features of the technical featureswithout departing from the concept of the present disclosure. Forexample, a technical solution formed by interchanging the above featuresand technical features having similar functions as disclosed, but notlimited to, in the present disclosure with each other is also coveredwith the scope of the present disclosure.

It should be noted that although the above operations are described in aspecific order, it should not be understood that these operations arerequired to be performed in the specific order or performed in asequential order. Under certain circumstances, multitasking and parallelprocessing may be advantageous. Although multiple implementation detailsare included in the above descriptions, the details should not beinterpreted as limitations to the scope of the present disclosure. Somefeatures described in a separate embodiment may be implemented incombination in a separate embodiment. Conversely, the features describedin a separate embodiment may be implemented in multiple embodimentsindividually or in any suitable sub-combination.

Although the subject matter has been described in language specific tostructural features and/or logical actions of the method, it should beunderstood that the subject matter defined in the appended claims areunnecessarily limited to the specific features or actions describedabove. The specific features and actions described above are merelyexemplary forms of implementing the claims. For the device according tothe embodiments described above, the manners in which the modulesperform operations have been described in detail in the embodiments ofthe method, which are not described in detail herein.

1. A data processing method, comprising: acquiring, by a terminal, aface image of a user as a first face image; displaying the first faceimage in a user display area of a screen of the terminal; acquiring asound signal of the user as a target sound signal, in a case that atarget object is displayed on the screen of the terminal; and switchinga display state of the target object and switching the first face imagein the user display area to a second face image, in response to a soundintensity of the target sound signal within a first intensity range,wherein the second face image is obtained based on the first face image.2. The data processing method according to claim 1, wherein the firstintensity range corresponds to the target object.
 3. The data processingmethod according to claim 1, wherein a body image is displayed in theuser display area; and the data processing method further comprises:determining position information of the user according to the first faceimage, wherein the position information is used to indicate an offsetdirection and an offset angle of the user relative to the terminal;determining the body image corresponding to the first face imageaccording to the position information and pre-stored image material; anddisplaying the body image in the user display area.
 4. The dataprocessing method according to claim 3, wherein the position informationis two-dimensional information or three-dimensional inftxmation.
 5. Thedata processing method according to claim 1, wherein the target objectis statically displayed on the screen of the terminal.
 6. The dataprocessing method according to claim 5, wherein the acquiring the soundsignal of the user as a target sound signal, in a case that a targetobject is displayed on the screen of the terminal, comprises: acquiringthe sound signal of the user as the target sound signal, in a presetperiod after the target object starts to be displayed on the screen ofthe terminal.
 7. The data processing method according to claim 1,wherein the target object is dynamically displayed on the screen of theterminal.
 8. The data processing method according to claim 7, whereinthe target object moves on the screen of the terminal at a first speedalong a preset path, in a process of displaying the target object on thescreen of the terminal.
 9. The data processing method according to claim8, wherein the acquiring the sound signal of the user as a target soundsignal, in a case that a target object is displayed on the screen of theterminal, comprises: acquiring the sound signal of the user as thetarget sound signal, in a case that the target object is located in adesignated area of the screen of the terminal.
 10. The data processingmethod according to claim 7, further comprising: performing a soundcontent recognition on the target sound signal, to determine a soundcontent corresponding to the target sound signal; determining a secondspeed corresponding to a preset keyword, in a case that the soundcontent corresponding to the target sound signal comprises the presetkeyword; and controlling a moving speed of the target object on thescreen of the terminal to switch from the first speed to the secondspeed.
 11. The data processing method according to claim 1, wherein theswitching a display state of the target object and switching the firstface image in the user display area to a second face image, in responseto the sound intensity of the target sound signal within the firstintensity range, comprises: determining a display state corresponding tothe first intensity range, in response to the sound intensity of thetarget sound signal within the first intensity range; and displaying thetarget object in the display state corresponding to the first intensityrange; and determining image material corresponding to the firstintensity range, in response to the sound intensity of the target soundsignal within the first intensity range; generating the second faceimage according to the first face image and the image materialcorresponding to the first intensity range, and replacing the first faceimage with the second face image.
 12. The data processing methodaccording to claim 11, wherein the first intensity range comprises afirst intensity interval and a second intensity interval, the firstintensity interval corresponds to a first display state and first imagematerial, and the second intensity interval corresponds to a seconddisplay state and second image material, and wherein the determining adisplay state corresponding to the first intensity range, in response tothe sound intensity of the target sound signal within the firstintensity range; and displaying the target object in the display statecorresponding to the first intensity range, comprises: determining thefirst display state corresponding to the first intensity interval, inresponse to the sound intensity of the target sound signal within thefirst intensity interval; and displaying the target object in the firstdisplay state; and determining the second display state corresponding tothe second intensity interval, in response to the sound intensity of thetarget sound signal within the second intensity interval; and displayingthe target object in the second display state; and the determining theimage material corresponding to the first intensity range, in responseto the sound intensity of the target sound signal within the firstintensity range; generating the second face image according to the firstface image and the image material corresponding to the first intensityrange; and replacing the first face image with the second face image,comprises: determining the first image material corresponding to thefirst intensity interval, in response to the sound intensity of thetarget sound signal within the first intensity interval; generating thesecond face image according to the first face image and the first imagematerial; and replacing the first face image with the second face image;and determining the second image material corresponding to the secondintensity interval, in response to the sound intensity of the targetsound signal within the second intensity interval; generating the secondface image according to the first face image and the second imagematerial; and replacing the first face image with the second face image.13. The data processing method according to claim 1, further comprising:keeping the display state of the target object unchanged in response tothe sound intensity of the target sound signal within a second intensityrange, and switching the first face image in the user display area to athird face image, wherein the third face image is obtained based on thefirst face image.
 14. The data processing method according to claim 1,further comprising: determining a first action value according to thesound intensity of the target sound signal, in response to obtaining aninstruction for determining a user action value, wherein the firstaction value is used to characterize a matching degree between the soundintensity of the target sound signal and a standard intensity demandedby the target object; and determining the user action value according tothe first action value.
 15. The data processing method according toclaim 1, further comprising: determining a second action value accordingto the first face image, in response to obtaining an instruction fordetermining a user action value, wherein the second action value is usedto characterize a matching degree between an expression feature of thefirst face image and a standard feature demanded by the target object;and determining the user action value according to the second actionvalue.
 16. The data processing method according to claim 1, furthercomprising: determining a first action value according to the soundintensity of the target sound signal and determining a second actionvalue according to the first face image, in response to obtaining aninstruction for determining a user action value, wherein the firstaction value is used to characterize a matching degree between the soundintensity of the target sound signal and a standard intensity demandedby the target object, and the second action value is used tocharacterize a matching degree between an expression feature of thefirst face image and a standard feature demanded by the target object;and determining the user action value according to the first actionvalue and the second action value.
 17. The data processing methodaccording to claim 14, further comprising: displaying the user actionvalue on the screen of the terminal.
 18. A non-transitorycomputer-readable medium storing a computer program thereon, wherein theprogram, when executed by a processing apparatus, causes the processingapparatus to: acquire a face image of a user as a first face image;display the first face image in a user display area of a screen of theterminal; acquire a sound signal of the user as a target sound signal,in a case that a target object is displayed on the screen of theterminal; and switch a display state of the target object and switch thefirst face image in the user display area to a second face image, inresponse to a sound intensity of the target sound signal within a firstintensity range, wherein the second face image is obtained based on thefirst face image.
 19. An electronic device, comprising: a storageapparatus storing a computer program thereon; a processing apparatus forexecuting the computer program in the storage apparatus to: acquire aface image of a user as a first face image; display the first face imagein a user display area of a screen of the terminal; acquire a soundsignal of the user as a target sound signal, in a case that a targetobject is displayed on the screen of the terminal; and switch a displaystate of the target object and switch the first face image in the userdisplay area to a second face image, in response to a sound intensity ofthe target sound signal within a first intensity range, wherein thesecond face image is obtained based on the first face image.
 20. Theelectronic device according to claim 19, wherein a body image isdisplayed in the user display area; and the computer program, whenexecuted by the processing apparatus, causes the processing apparatusto: determine position information of the user according to the firstface image, wherein the position information is used to indicate anoffset direction and an offset angle of the user relative to theterminal; determine the body image corresponding to the first face imageaccording to the position information and pre-stored image material; anddisplay the body image in the user display area.